Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 193404 |
| Missing cells | 555626 |
| Missing cells (%) | 13.7% |
| Duplicate rows | 256 |
| Duplicate rows (%) | 0.1% |
| Total size in memory | 101.6 MiB |
| Average record size in memory | 551.1 B |
Variable types
| NUM | 11 |
|---|---|
| CAT | 9 |
| BOOL | 1 |
| Dataset has 256 (0.1%) duplicate rows | Duplicates |
batsman has a high cardinality: 546 distinct values | High cardinality |
non_striker has a high cardinality: 541 distinct values | High cardinality |
bowler has a high cardinality: 437 distinct values | High cardinality |
player_dismissed has a high cardinality: 507 distinct values | High cardinality |
fielder has a high cardinality: 507 distinct values | High cardinality |
total_runs is highly correlated with batsman_runs | High correlation |
batsman_runs is highly correlated with total_runs | High correlation |
player_dismissed has 184357 (95.3%) missing values | Missing |
dismissal_kind has 184357 (95.3%) missing values | Missing |
fielder has 186906 (96.6%) missing values | Missing |
bye_runs is highly skewed (γ1 = 30.46379722) | Skewed |
noball_runs is highly skewed (γ1 = 34.78726606) | Skewed |
wide_runs has 187537 (97.0%) zeros | Zeros |
bye_runs has 192898 (99.7%) zeros | Zeros |
legbye_runs has 190292 (98.4%) zeros | Zeros |
noball_runs has 192640 (99.6%) zeros | Zeros |
batsman_runs has 76062 (39.3%) zeros | Zeros |
extra_runs has 183154 (94.7%) zeros | Zeros |
total_runs has 67527 (34.9%) zeros | Zeros |
Reproduction
| Analysis started | 2020-12-21 07:05:58.719678 |
|---|---|
| Analysis finished | 2020-12-21 07:07:16.679575 |
| Duration | 1 minute and 17.96 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
match_id
Real number (ℝ≥0)
| Distinct | 816 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 91884.46912 |
|---|---|
| Minimum | 1 |
| Maximum | 1237181 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 41 |
| Q1 | 205 |
| median | 408 |
| Q3 | 613 |
| 95-th percentile | 1216511 |
| Maximum | 1237181 |
| Range | 1237180 |
| Interquartile range (IQR) | 408 |
Descriptive statistics
| Standard deviation | 318512.9064 |
|---|---|
| Coefficient of variation (CV) | 3.466449874 |
| Kurtosis | 8.579006802 |
| Mean | 91884.46912 |
| Median Absolute Deviation (MAD) | 204 |
| Skewness | 3.252235282 |
| Sum | 1.777082387e+10 |
| Variance | 1.014504715e+11 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1216498 | 466 | 0.2% | |
| 126 | 267 | 0.1% | |
| 34 | 263 | 0.1% | |
| 534 | 262 | 0.1% | |
| 476 | 262 | 0.1% | |
| 388 | 261 | 0.1% | |
| 570 | 259 | 0.1% | |
| 190 | 259 | 0.1% | |
| 536 | 258 | 0.1% | |
| 401 | 258 | 0.1% | |
| 11146 | 257 | 0.1% | |
| 1216517 | 257 | 0.1% | |
| 211 | 257 | 0.1% | |
| 257 | 257 | 0.1% | |
| 11339 | 257 | 0.1% | |
| 516 | 257 | 0.1% | |
| 367 | 256 | 0.1% | |
| 567 | 256 | 0.1% | |
| 153 | 255 | 0.1% | |
| 50 | 255 | 0.1% | |
| 539 | 255 | 0.1% | |
| 553 | 255 | 0.1% | |
| 196 | 255 | 0.1% | |
| 67 | 255 | 0.1% | |
| 488 | 255 | 0.1% | |
| Other values (791) | 186750 | 96.6% |
| Value | Count | Frequency (%) | |
| 1 | 248 | 0.1% | |
| 2 | 247 | 0.1% | |
| 3 | 218 | 0.1% | |
| 4 | 247 | 0.1% | |
| 5 | 248 | 0.1% | |
| 6 | 216 | 0.1% | |
| 7 | 254 | 0.1% | |
| 8 | 212 | 0.1% | |
| 9 | 226 | 0.1% | |
| 10 | 239 | 0.1% |
| Value | Count | Frequency (%) | |
| 1237181 | 235 | 0.1% | |
| 1237180 | 250 | 0.1% | |
| 1237178 | 247 | 0.1% | |
| 1237177 | 247 | 0.1% | |
| 1216547 | 251 | 0.1% | |
| 1216546 | 235 | 0.1% | |
| 1216545 | 230 | 0.1% | |
| 1216544 | 234 | 0.1% | |
| 1216543 | 242 | 0.1% | |
| 1216542 | 216 | 0.1% |
inning
Real number (ℝ≥0)
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.483159604 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.5019063714 |
|---|---|
| Coefficient of variation (CV) | 0.3384034801 |
| Kurtosis | -1.768127923 |
| Mean | 1.483159604 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.1105838407 |
| Sum | 286849 |
| Variance | 0.2519100057 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 100109 | 51.8% | |
| 2 | 93199 | 48.2% | |
| 3 | 50 | < 0.1% | |
| 4 | 38 | < 0.1% | |
| 5 | 8 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 100109 | 51.8% | |
| 2 | 93199 | 48.2% | |
| 3 | 50 | < 0.1% | |
| 4 | 38 | < 0.1% | |
| 5 | 8 | < 0.1% |
| Value | Count | Frequency (%) | |
| 5 | 8 | < 0.1% | |
| 4 | 38 | < 0.1% | |
| 3 | 50 | < 0.1% | |
| 2 | 93199 | 48.2% | |
| 1 | 100109 | 51.8% |
batting_team
Categorical
| Distinct | 23 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Mumbai Indians | |
|---|---|
| Kings XI Punjab | |
| Royal Challengers Bangalore | |
| Kolkata Knight Riders | |
| Chennai Super Kings | |
| Other values (18) |
| Value | Count | Frequency (%) | |
| Mumbai Indians | 22619 | 11.7% | |
| Kings XI Punjab | 20931 | 10.8% | |
| Royal Challengers Bangalore | 20908 | 10.8% | |
| Kolkata Knight Riders | 20858 | 10.8% | |
| Chennai Super Kings | 19762 | 10.2% | |
| Delhi Daredevils | 18786 | 9.7% | |
| Rajasthan Royals | 17292 | 8.9% | |
| Sunrisers Hyderabad | 12908 | 6.7% | |
| Deccan Chargers | 9034 | 4.7% | |
| Pune Warriors | 5443 | 2.8% | |
| Gujarat Lions | 3566 | 1.8% | |
| DC | 2042 | 1.1% | |
| SRH | 1988 | 1.0% | |
| Delhi Capitals | 1909 | 1.0% | |
| Rising Pune Supergiant | 1900 | 1.0% | |
| MI | 1829 | 0.9% | |
| KXIP | 1785 | 0.9% | |
| RCB | 1772 | 0.9% | |
| CSK | 1651 | 0.9% | |
| KKR | 1650 | 0.9% | |
| RR | 1609 | 0.8% | |
| Kochi Tuskers Kerala | 1582 | 0.8% | |
| Rising Pune Supergiants | 1580 | 0.8% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 27 |
|---|---|
| Median length | 16 |
| Mean length | 16.85362764 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 366154 | 11.2% | |
| n | 267743 | 8.2% | |
| 266599 | 8.2% | ||
| e | 240824 | 7.4% | |
| i | 222738 | 6.8% | |
| s | 212440 | 6.5% | |
| r | 184553 | 5.7% | |
| l | 164754 | 5.1% | |
| g | 119361 | 3.7% | |
| h | 110131 | 3.4% | |
| u | 93771 | 2.9% | |
| K | 92309 | 2.8% | |
| o | 90557 | 2.8% | |
| R | 88458 | 2.7% | |
| d | 88079 | 2.7% | |
| t | 67963 | 2.1% | |
| C | 57078 | 1.8% | |
| b | 56458 | 1.7% | |
| y | 51108 | 1.6% | |
| D | 50557 | 1.6% | |
| I | 47164 | 1.4% | |
| j | 41789 | 1.3% | |
| S | 39789 | 1.2% | |
| P | 31639 | 1.0% | |
| p | 25151 | 0.8% | |
| Other values (12) | 182392 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 2487069 | 76.3% | |
| Uppercase Letter | 505891 | 15.5% | |
| Space Separator | 266599 | 8.2% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| K | 92309 | 18.2% | |
| R | 88458 | 17.5% | |
| C | 57078 | 11.3% | |
| D | 50557 | 10.0% | |
| I | 47164 | 9.3% | |
| S | 39789 | 7.9% | |
| P | 31639 | 6.3% | |
| M | 24448 | 4.8% | |
| X | 22716 | 4.5% | |
| B | 22680 | 4.5% | |
| H | 14896 | 2.9% | |
| W | 5443 | 1.1% | |
| G | 3566 | 0.7% | |
| L | 3566 | 0.7% | |
| T | 1582 | 0.3% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 366154 | 14.7% | |
| n | 267743 | 10.8% | |
| e | 240824 | 9.7% | |
| i | 222738 | 9.0% | |
| s | 212440 | 8.5% | |
| r | 184553 | 7.4% | |
| l | 164754 | 6.6% | |
| g | 119361 | 4.8% | |
| h | 110131 | 4.4% | |
| u | 93771 | 3.8% | |
| o | 90557 | 3.6% | |
| d | 88079 | 3.5% | |
| t | 67963 | 2.7% | |
| b | 56458 | 2.3% | |
| y | 51108 | 2.1% | |
| j | 41789 | 1.7% | |
| p | 25151 | 1.0% | |
| m | 22619 | 0.9% | |
| k | 22440 | 0.9% | |
| c | 19650 | 0.8% | |
| v | 18786 | 0.8% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 266599 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 2992960 | 91.8% | |
| Common | 266599 | 8.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 366154 | 12.2% | |
| n | 267743 | 8.9% | |
| e | 240824 | 8.0% | |
| i | 222738 | 7.4% | |
| s | 212440 | 7.1% | |
| r | 184553 | 6.2% | |
| l | 164754 | 5.5% | |
| g | 119361 | 4.0% | |
| h | 110131 | 3.7% | |
| u | 93771 | 3.1% | |
| K | 92309 | 3.1% | |
| o | 90557 | 3.0% | |
| R | 88458 | 3.0% | |
| d | 88079 | 2.9% | |
| t | 67963 | 2.3% | |
| C | 57078 | 1.9% | |
| b | 56458 | 1.9% | |
| y | 51108 | 1.7% | |
| D | 50557 | 1.7% | |
| I | 47164 | 1.6% | |
| j | 41789 | 1.4% | |
| S | 39789 | 1.3% | |
| P | 31639 | 1.1% | |
| p | 25151 | 0.8% | |
| M | 24448 | 0.8% | |
| Other values (11) | 157944 | 5.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 266599 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 3259559 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 366154 | 11.2% | |
| n | 267743 | 8.2% | |
| 266599 | 8.2% | ||
| e | 240824 | 7.4% | |
| i | 222738 | 6.8% | |
| s | 212440 | 6.5% | |
| r | 184553 | 5.7% | |
| l | 164754 | 5.1% | |
| g | 119361 | 3.7% | |
| h | 110131 | 3.4% | |
| u | 93771 | 2.9% | |
| K | 92309 | 2.8% | |
| o | 90557 | 2.8% | |
| R | 88458 | 2.7% | |
| d | 88079 | 2.7% | |
| t | 67963 | 2.1% | |
| C | 57078 | 1.8% | |
| b | 56458 | 1.7% | |
| y | 51108 | 1.6% | |
| D | 50557 | 1.6% | |
| I | 47164 | 1.4% | |
| j | 41789 | 1.3% | |
| S | 39789 | 1.2% | |
| P | 31639 | 1.0% | |
| p | 25151 | 0.8% | |
| Other values (12) | 182392 | 5.6% |
bowling_team
Categorical
| Distinct | 23 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Mumbai Indians | |
|---|---|
| Royal Challengers Bangalore | |
| Kolkata Knight Riders | |
| Kings XI Punjab | |
| Chennai Super Kings | |
| Other values (18) |
| Value | Count | Frequency (%) | |
| Mumbai Indians | 22517 | 11.6% | |
| Royal Challengers Bangalore | 21236 | 11.0% | |
| Kolkata Knight Riders | 20940 | 10.8% | |
| Kings XI Punjab | 20782 | 10.7% | |
| Chennai Super Kings | 19556 | 10.1% | |
| Delhi Daredevils | 18725 | 9.7% | |
| Rajasthan Royals | 17382 | 9.0% | |
| Sunrisers Hyderabad | 12779 | 6.6% | |
| Deccan Chargers | 9039 | 4.7% | |
| Pune Warriors | 5457 | 2.8% | |
| Gujarat Lions | 3545 | 1.8% | |
| SRH | 1999 | 1.0% | |
| DC | 1996 | 1.0% | |
| Delhi Capitals | 1963 | 1.0% | |
| Rising Pune Supergiant | 1928 | 1.0% | |
| MI | 1899 | 1.0% | |
| RCB | 1759 | 0.9% | |
| KXIP | 1753 | 0.9% | |
| RR | 1688 | 0.9% | |
| CSK | 1619 | 0.8% | |
| Rising Pune Supergiants | 1615 | 0.8% | |
| Kochi Tuskers Kerala | 1614 | 0.8% | |
| KKR | 1613 | 0.8% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 27 |
|---|---|
| Median length | 16 |
| Mean length | 16.87265517 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 367329 | 11.3% | |
| n | 267509 | 8.2% | |
| 266749 | 8.2% | ||
| e | 241305 | 7.4% | |
| i | 222208 | 6.8% | |
| s | 212468 | 6.5% | |
| r | 184795 | 5.7% | |
| l | 166256 | 5.1% | |
| g | 119875 | 3.7% | |
| h | 110455 | 3.4% | |
| u | 93336 | 2.9% | |
| K | 92044 | 2.8% | |
| o | 91410 | 2.8% | |
| R | 89230 | 2.7% | |
| d | 87740 | 2.7% | |
| t | 68313 | 2.1% | |
| C | 57168 | 1.8% | |
| b | 56078 | 1.7% | |
| y | 51397 | 1.6% | |
| D | 50448 | 1.5% | |
| I | 46951 | 1.4% | |
| j | 41709 | 1.3% | |
| S | 39496 | 1.2% | |
| P | 31535 | 1.0% | |
| p | 25062 | 0.8% | |
| Other values (12) | 182373 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 2490733 | 76.3% | |
| Uppercase Letter | 505757 | 15.5% | |
| Space Separator | 266749 | 8.2% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| K | 92044 | 18.2% | |
| R | 89230 | 17.6% | |
| C | 57168 | 11.3% | |
| D | 50448 | 10.0% | |
| I | 46951 | 9.3% | |
| S | 39496 | 7.8% | |
| P | 31535 | 6.2% | |
| M | 24416 | 4.8% | |
| B | 22995 | 4.5% | |
| X | 22535 | 4.5% | |
| H | 14778 | 2.9% | |
| W | 5457 | 1.1% | |
| G | 3545 | 0.7% | |
| L | 3545 | 0.7% | |
| T | 1614 | 0.3% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 367329 | 14.7% | |
| n | 267509 | 10.7% | |
| e | 241305 | 9.7% | |
| i | 222208 | 8.9% | |
| s | 212468 | 8.5% | |
| r | 184795 | 7.4% | |
| l | 166256 | 6.7% | |
| g | 119875 | 4.8% | |
| h | 110455 | 4.4% | |
| u | 93336 | 3.7% | |
| o | 91410 | 3.7% | |
| d | 87740 | 3.5% | |
| t | 68313 | 2.7% | |
| b | 56078 | 2.3% | |
| y | 51397 | 2.1% | |
| j | 41709 | 1.7% | |
| p | 25062 | 1.0% | |
| k | 22554 | 0.9% | |
| m | 22517 | 0.9% | |
| c | 19692 | 0.8% | |
| v | 18725 | 0.8% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 266749 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 2996490 | 91.8% | |
| Common | 266749 | 8.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 367329 | 12.3% | |
| n | 267509 | 8.9% | |
| e | 241305 | 8.1% | |
| i | 222208 | 7.4% | |
| s | 212468 | 7.1% | |
| r | 184795 | 6.2% | |
| l | 166256 | 5.5% | |
| g | 119875 | 4.0% | |
| h | 110455 | 3.7% | |
| u | 93336 | 3.1% | |
| K | 92044 | 3.1% | |
| o | 91410 | 3.1% | |
| R | 89230 | 3.0% | |
| d | 87740 | 2.9% | |
| t | 68313 | 2.3% | |
| C | 57168 | 1.9% | |
| b | 56078 | 1.9% | |
| y | 51397 | 1.7% | |
| D | 50448 | 1.7% | |
| I | 46951 | 1.6% | |
| j | 41709 | 1.4% | |
| S | 39496 | 1.3% | |
| P | 31535 | 1.1% | |
| p | 25062 | 0.8% | |
| M | 24416 | 0.8% | |
| Other values (11) | 157957 | 5.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 266749 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 3263239 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 367329 | 11.3% | |
| n | 267509 | 8.2% | |
| 266749 | 8.2% | ||
| e | 241305 | 7.4% | |
| i | 222208 | 6.8% | |
| s | 212468 | 6.5% | |
| r | 184795 | 5.7% | |
| l | 166256 | 5.1% | |
| g | 119875 | 3.7% | |
| h | 110455 | 3.4% | |
| u | 93336 | 2.9% | |
| K | 92044 | 2.8% | |
| o | 91410 | 2.8% | |
| R | 89230 | 2.7% | |
| d | 87740 | 2.7% | |
| t | 68313 | 2.1% | |
| C | 57168 | 1.8% | |
| b | 56078 | 1.7% | |
| y | 51397 | 1.6% | |
| D | 50448 | 1.5% | |
| I | 46951 | 1.4% | |
| j | 41709 | 1.3% | |
| S | 39496 | 1.2% | |
| P | 31535 | 1.0% | |
| p | 25062 | 0.8% | |
| Other values (12) | 182373 | 5.6% |
over
Real number (ℝ≥0)
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.16503795 |
|---|---|
| Minimum | 1 |
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 5 |
| median | 10 |
| Q3 | 15 |
| 95-th percentile | 19 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 5.678812928 |
|---|---|
| Coefficient of variation (CV) | 0.5586612618 |
| Kurtosis | -1.183427499 |
| Mean | 10.16503795 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.04821941158 |
| Sum | 1965959 |
| Variance | 32.24891627 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 10396 | 5.4% | |
| 2 | 10251 | 5.3% | |
| 3 | 10153 | 5.2% | |
| 4 | 10112 | 5.2% | |
| 5 | 10084 | 5.2% | |
| 6 | 10064 | 5.2% | |
| 7 | 10020 | 5.2% | |
| 8 | 9993 | 5.2% | |
| 9 | 9958 | 5.1% | |
| 10 | 9914 | 5.1% | |
| 11 | 9854 | 5.1% | |
| 12 | 9822 | 5.1% | |
| 13 | 9803 | 5.1% | |
| 14 | 9698 | 5.0% | |
| 15 | 9623 | 5.0% | |
| 16 | 9459 | 4.9% | |
| 17 | 9345 | 4.8% | |
| 18 | 9061 | 4.7% | |
| 19 | 8497 | 4.4% | |
| 20 | 7297 | 3.8% |
| Value | Count | Frequency (%) | |
| 1 | 10396 | 5.4% | |
| 2 | 10251 | 5.3% | |
| 3 | 10153 | 5.2% | |
| 4 | 10112 | 5.2% | |
| 5 | 10084 | 5.2% | |
| 6 | 10064 | 5.2% | |
| 7 | 10020 | 5.2% | |
| 8 | 9993 | 5.2% | |
| 9 | 9958 | 5.1% | |
| 10 | 9914 | 5.1% |
| Value | Count | Frequency (%) | |
| 20 | 7297 | 3.8% | |
| 19 | 8497 | 4.4% | |
| 18 | 9061 | 4.7% | |
| 17 | 9345 | 4.8% | |
| 16 | 9459 | 4.9% | |
| 15 | 9623 | 5.0% | |
| 14 | 9698 | 5.0% | |
| 13 | 9803 | 5.1% | |
| 12 | 9822 | 5.1% | |
| 11 | 9854 | 5.1% |
ball
Real number (ℝ≥0)
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.615638767 |
|---|---|
| Minimum | 1 |
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.807174232 |
|---|---|
| Coefficient of variation (CV) | 0.4998215664 |
| Kurtosis | -1.081527294 |
| Mean | 3.615638767 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.09665574213 |
| Sum | 699279 |
| Variance | 3.265878704 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 31377 | 16.2% | |
| 2 | 31272 | 16.2% | |
| 3 | 31187 | 16.1% | |
| 4 | 31124 | 16.1% | |
| 5 | 31014 | 16.0% | |
| 6 | 30915 | 16.0% | |
| 7 | 5515 | 2.9% | |
| 8 | 865 | 0.4% | |
| 9 | 134 | 0.1% | |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 31377 | 16.2% | |
| 2 | 31272 | 16.2% | |
| 3 | 31187 | 16.1% | |
| 4 | 31124 | 16.1% | |
| 5 | 31014 | 16.0% | |
| 6 | 30915 | 16.0% | |
| 7 | 5515 | 2.9% | |
| 8 | 865 | 0.4% | |
| 9 | 134 | 0.1% | |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 10 | 1 | < 0.1% | |
| 9 | 134 | 0.1% | |
| 8 | 865 | 0.4% | |
| 7 | 5515 | 2.9% | |
| 6 | 30915 | 16.0% | |
| 5 | 31014 | 16.0% | |
| 4 | 31124 | 16.1% | |
| 3 | 31187 | 16.1% | |
| 2 | 31272 | 16.2% | |
| 1 | 31377 | 16.2% |
| Distinct | 546 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| V Kohli | 4604 |
|---|---|
| S Dhawan | 4201 |
| RG Sharma | 4081 |
| SK Raina | 4044 |
| DA Warner | 3831 |
| Other values (541) |
| Value | Count | Frequency (%) | |
| V Kohli | 4604 | 2.4% | |
| S Dhawan | 4201 | 2.2% | |
| RG Sharma | 4081 | 2.1% | |
| SK Raina | 4044 | 2.1% | |
| DA Warner | 3831 | 2.0% | |
| RV Uthappa | 3652 | 1.9% | |
| G Gambhir | 3524 | 1.8% | |
| MS Dhoni | 3487 | 1.8% | |
| CH Gayle | 3373 | 1.7% | |
| AM Rahane | 3326 | 1.7% | |
| AB de Villiers | 3265 | 1.7% | |
| KD Karthik | 3020 | 1.6% | |
| AT Rayudu | 2964 | 1.5% | |
| SR Watson | 2889 | 1.5% | |
| MK Pandey | 2794 | 1.4% | |
| PA Patel | 2444 | 1.3% | |
| YK Pathan | 2334 | 1.2% | |
| JH Kallis | 2291 | 1.2% | |
| BB McCullum | 2272 | 1.2% | |
| Yuvraj Singh | 2207 | 1.1% | |
| M Vijay | 2206 | 1.1% | |
| KA Pollard | 2118 | 1.1% | |
| SR Tendulkar | 2044 | 1.1% | |
| KL Rahul | 2016 | 1.0% | |
| SV Samson | 1962 | 1.0% | |
| Other values (521) | 118455 | 61.2% |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | < 0.1% |
Length
| Max length | 20 |
|---|---|
| Median length | 9 |
| Mean length | 9.317325391 |
| Min length | 5 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 203137 | 11.3% | ||
| a | 199495 | 11.1% | |
| i | 87774 | 4.9% | |
| n | 83911 | 4.7% | |
| h | 81454 | 4.5% | |
| r | 77592 | 4.3% | |
| S | 73581 | 4.1% | |
| e | 73196 | 4.1% | |
| l | 68487 | 3.8% | |
| s | 48318 | 2.7% | |
| R | 47367 | 2.6% | |
| A | 45074 | 2.5% | |
| M | 44989 | 2.5% | |
| K | 44415 | 2.5% | |
| o | 41260 | 2.3% | |
| t | 40479 | 2.2% | |
| d | 39106 | 2.2% | |
| P | 38129 | 2.1% | |
| u | 37540 | 2.1% | |
| D | 37280 | 2.1% | |
| y | 34696 | 1.9% | |
| m | 30832 | 1.7% | |
| J | 26184 | 1.5% | |
| G | 25532 | 1.4% | |
| V | 24413 | 1.4% | |
| Other values (29) | 247767 | 13.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1050493 | 58.3% | |
| Uppercase Letter | 548146 | 30.4% | |
| Space Separator | 203137 | 11.3% | |
| Dash Punctuation | 232 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| S | 73581 | 13.4% | |
| R | 47367 | 8.6% | |
| A | 45074 | 8.2% | |
| M | 44989 | 8.2% | |
| K | 44415 | 8.1% | |
| P | 38129 | 7.0% | |
| D | 37280 | 6.8% | |
| J | 26184 | 4.8% | |
| G | 25532 | 4.7% | |
| V | 24413 | 4.5% | |
| B | 21705 | 4.0% | |
| C | 21452 | 3.9% | |
| H | 20025 | 3.7% | |
| T | 15762 | 2.9% | |
| W | 11882 | 2.2% | |
| L | 11059 | 2.0% | |
| Y | 8196 | 1.5% | |
| N | 8112 | 1.5% | |
| E | 5512 | 1.0% | |
| F | 5044 | 0.9% | |
| U | 4444 | 0.8% | |
| I | 4176 | 0.8% | |
| O | 2012 | 0.4% | |
| Q | 1600 | 0.3% | |
| Z | 184 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 203137 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 199495 | 19.0% | |
| i | 87774 | 8.4% | |
| n | 83911 | 8.0% | |
| h | 81454 | 7.8% | |
| r | 77592 | 7.4% | |
| e | 73196 | 7.0% | |
| l | 68487 | 6.5% | |
| s | 48318 | 4.6% | |
| o | 41260 | 3.9% | |
| t | 40479 | 3.9% | |
| d | 39106 | 3.7% | |
| u | 37540 | 3.6% | |
| y | 34696 | 3.3% | |
| m | 30832 | 2.9% | |
| w | 18045 | 1.7% | |
| g | 17824 | 1.7% | |
| k | 17093 | 1.6% | |
| p | 12682 | 1.2% | |
| j | 9803 | 0.9% | |
| v | 9193 | 0.9% | |
| c | 9154 | 0.9% | |
| b | 8480 | 0.8% | |
| x | 1290 | 0.1% | |
| f | 986 | 0.1% | |
| q | 937 | 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 232 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1598639 | 88.7% | |
| Common | 203369 | 11.3% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 199495 | 12.5% | |
| i | 87774 | 5.5% | |
| n | 83911 | 5.2% | |
| h | 81454 | 5.1% | |
| r | 77592 | 4.9% | |
| S | 73581 | 4.6% | |
| e | 73196 | 4.6% | |
| l | 68487 | 4.3% | |
| s | 48318 | 3.0% | |
| R | 47367 | 3.0% | |
| A | 45074 | 2.8% | |
| M | 44989 | 2.8% | |
| K | 44415 | 2.8% | |
| o | 41260 | 2.6% | |
| t | 40479 | 2.5% | |
| d | 39106 | 2.4% | |
| P | 38129 | 2.4% | |
| u | 37540 | 2.3% | |
| D | 37280 | 2.3% | |
| y | 34696 | 2.2% | |
| m | 30832 | 1.9% | |
| J | 26184 | 1.6% | |
| G | 25532 | 1.6% | |
| V | 24413 | 1.5% | |
| B | 21705 | 1.4% | |
| Other values (27) | 225830 | 14.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 203137 | 99.9% | ||
| - | 232 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1802008 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 203137 | 11.3% | ||
| a | 199495 | 11.1% | |
| i | 87774 | 4.9% | |
| n | 83911 | 4.7% | |
| h | 81454 | 4.5% | |
| r | 77592 | 4.3% | |
| S | 73581 | 4.1% | |
| e | 73196 | 4.1% | |
| l | 68487 | 3.8% | |
| s | 48318 | 2.7% | |
| R | 47367 | 2.6% | |
| A | 45074 | 2.5% | |
| M | 44989 | 2.5% | |
| K | 44415 | 2.5% | |
| o | 41260 | 2.3% | |
| t | 40479 | 2.2% | |
| d | 39106 | 2.2% | |
| P | 38129 | 2.1% | |
| u | 37540 | 2.1% | |
| D | 37280 | 2.1% | |
| y | 34696 | 1.9% | |
| m | 30832 | 1.7% | |
| J | 26184 | 1.5% | |
| G | 25532 | 1.4% | |
| V | 24413 | 1.4% | |
| Other values (29) | 247767 | 13.7% |
| Distinct | 541 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Memory size | 1.5 MiB |
| S Dhawan | 4601 |
|---|---|
| V Kohli | 4455 |
| SK Raina | 4173 |
| RG Sharma | 4132 |
| G Gambhir | 3740 |
| Other values (536) |
| Value | Count | Frequency (%) | |
| S Dhawan | 4601 | 2.4% | |
| V Kohli | 4455 | 2.3% | |
| SK Raina | 4173 | 2.2% | |
| RG Sharma | 4132 | 2.1% | |
| G Gambhir | 3740 | 1.9% | |
| AM Rahane | 3573 | 1.8% | |
| DA Warner | 3553 | 1.8% | |
| RV Uthappa | 3543 | 1.8% | |
| AB de Villiers | 3239 | 1.7% | |
| CH Gayle | 3193 | 1.7% | |
| MS Dhoni | 3159 | 1.6% | |
| AT Rayudu | 3089 | 1.6% | |
| KD Karthik | 3061 | 1.6% | |
| MK Pandey | 2934 | 1.5% | |
| SR Watson | 2737 | 1.4% | |
| PA Patel | 2608 | 1.3% | |
| SR Tendulkar | 2427 | 1.3% | |
| BB McCullum | 2356 | 1.2% | |
| JH Kallis | 2333 | 1.2% | |
| M Vijay | 2296 | 1.2% | |
| YK Pathan | 2165 | 1.1% | |
| KL Rahul | 2005 | 1.0% | |
| Yuvraj Singh | 1985 | 1.0% | |
| F du Plessis | 1974 | 1.0% | |
| SV Samson | 1969 | 1.0% | |
| Other values (516) | 118098 | 61.1% |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Length
| Max length | 20 |
|---|---|
| Median length | 9 |
| Mean length | 9.318571488 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 203150 | 11.3% | ||
| a | 200959 | 11.2% | |
| i | 87485 | 4.9% | |
| n | 83835 | 4.7% | |
| h | 81400 | 4.5% | |
| r | 77396 | 4.3% | |
| e | 73885 | 4.1% | |
| S | 73791 | 4.1% | |
| l | 67981 | 3.8% | |
| s | 48047 | 2.7% | |
| R | 47383 | 2.6% | |
| M | 45674 | 2.5% | |
| A | 44856 | 2.5% | |
| K | 44433 | 2.5% | |
| o | 39817 | 2.2% | |
| t | 39731 | 2.2% | |
| d | 39713 | 2.2% | |
| u | 38006 | 2.1% | |
| P | 37775 | 2.1% | |
| D | 36696 | 2.0% | |
| y | 35253 | 2.0% | |
| m | 30823 | 1.7% | |
| J | 26195 | 1.5% | |
| G | 25883 | 1.4% | |
| V | 24571 | 1.4% | |
| Other values (29) | 247511 | 13.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1050911 | 58.3% | |
| Uppercase Letter | 547936 | 30.4% | |
| Space Separator | 203150 | 11.3% | |
| Dash Punctuation | 252 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| S | 73791 | 13.5% | |
| R | 47383 | 8.6% | |
| M | 45674 | 8.3% | |
| A | 44856 | 8.2% | |
| K | 44433 | 8.1% | |
| P | 37775 | 6.9% | |
| D | 36696 | 6.7% | |
| J | 26195 | 4.8% | |
| G | 25883 | 4.7% | |
| V | 24571 | 4.5% | |
| B | 21639 | 3.9% | |
| C | 21151 | 3.9% | |
| H | 20104 | 3.7% | |
| T | 16462 | 3.0% | |
| W | 11484 | 2.1% | |
| L | 10978 | 2.0% | |
| N | 7948 | 1.5% | |
| Y | 7673 | 1.4% | |
| E | 5639 | 1.0% | |
| F | 5077 | 0.9% | |
| I | 4409 | 0.8% | |
| U | 4352 | 0.8% | |
| O | 1995 | 0.4% | |
| Q | 1545 | 0.3% | |
| Z | 206 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 203150 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 200959 | 19.1% | |
| i | 87485 | 8.3% | |
| n | 83835 | 8.0% | |
| h | 81400 | 7.7% | |
| r | 77396 | 7.4% | |
| e | 73885 | 7.0% | |
| l | 67981 | 6.5% | |
| s | 48047 | 4.6% | |
| o | 39817 | 3.8% | |
| t | 39731 | 3.8% | |
| d | 39713 | 3.8% | |
| u | 38006 | 3.6% | |
| y | 35253 | 3.4% | |
| m | 30823 | 2.9% | |
| w | 18395 | 1.8% | |
| g | 17669 | 1.7% | |
| k | 17434 | 1.7% | |
| p | 12658 | 1.2% | |
| j | 9669 | 0.9% | |
| c | 9052 | 0.9% | |
| v | 8863 | 0.8% | |
| b | 8854 | 0.8% | |
| x | 1226 | 0.1% | |
| q | 1053 | 0.1% | |
| f | 879 | 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 252 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1598847 | 88.7% | |
| Common | 203402 | 11.3% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 200959 | 12.6% | |
| i | 87485 | 5.5% | |
| n | 83835 | 5.2% | |
| h | 81400 | 5.1% | |
| r | 77396 | 4.8% | |
| e | 73885 | 4.6% | |
| S | 73791 | 4.6% | |
| l | 67981 | 4.3% | |
| s | 48047 | 3.0% | |
| R | 47383 | 3.0% | |
| M | 45674 | 2.9% | |
| A | 44856 | 2.8% | |
| K | 44433 | 2.8% | |
| o | 39817 | 2.5% | |
| t | 39731 | 2.5% | |
| d | 39713 | 2.5% | |
| u | 38006 | 2.4% | |
| P | 37775 | 2.4% | |
| D | 36696 | 2.3% | |
| y | 35253 | 2.2% | |
| m | 30823 | 1.9% | |
| J | 26195 | 1.6% | |
| G | 25883 | 1.6% | |
| V | 24571 | 1.5% | |
| B | 21639 | 1.4% | |
| Other values (27) | 225620 | 14.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 203150 | 99.9% | ||
| - | 252 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1802249 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 203150 | 11.3% | ||
| a | 200959 | 11.2% | |
| i | 87485 | 4.9% | |
| n | 83835 | 4.7% | |
| h | 81400 | 4.5% | |
| r | 77396 | 4.3% | |
| e | 73885 | 4.1% | |
| S | 73791 | 4.1% | |
| l | 67981 | 3.8% | |
| s | 48047 | 2.7% | |
| R | 47383 | 2.6% | |
| M | 45674 | 2.5% | |
| A | 44856 | 2.5% | |
| K | 44433 | 2.5% | |
| o | 39817 | 2.2% | |
| t | 39731 | 2.2% | |
| d | 39713 | 2.2% | |
| u | 38006 | 2.1% | |
| P | 37775 | 2.1% | |
| D | 36696 | 2.0% | |
| y | 35253 | 2.0% | |
| m | 30823 | 1.7% | |
| J | 26195 | 1.5% | |
| G | 25883 | 1.4% | |
| V | 24571 | 1.4% | |
| Other values (29) | 247511 | 13.7% |
| Distinct | 437 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Harbhajan Singh | 3451 |
|---|---|
| R Ashwin | 3320 |
| PP Chawla | 3279 |
| A Mishra | 3230 |
| SL Malinga | 2974 |
| Other values (432) |
| Value | Count | Frequency (%) | |
| Harbhajan Singh | 3451 | 1.8% | |
| R Ashwin | 3320 | 1.7% | |
| PP Chawla | 3279 | 1.7% | |
| A Mishra | 3230 | 1.7% | |
| SL Malinga | 2974 | 1.5% | |
| DJ Bravo | 2839 | 1.5% | |
| SP Narine | 2826 | 1.5% | |
| RA Jadeja | 2758 | 1.4% | |
| P Kumar | 2721 | 1.4% | |
| B Kumar | 2707 | 1.4% | |
| UT Yadav | 2650 | 1.4% | |
| DW Steyn | 2281 | 1.2% | |
| Z Khan | 2276 | 1.2% | |
| R Vinay Kumar | 2186 | 1.1% | |
| YS Chahal | 2175 | 1.1% | |
| JJ Bumrah | 2165 | 1.1% | |
| SR Watson | 2137 | 1.1% | |
| IK Pathan | 2113 | 1.1% | |
| I Sharma | 1999 | 1.0% | |
| A Nehra | 1974 | 1.0% | |
| PP Ojha | 1945 | 1.0% | |
| AR Patel | 1894 | 1.0% | |
| Sandeep Sharma | 1878 | 1.0% | |
| RP Singh | 1874 | 1.0% | |
| DS Kulkarni | 1850 | 1.0% | |
| Other values (412) | 131902 | 68.2% |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Length
| Max length | 17 |
|---|---|
| Median length | 9 |
| Mean length | 9.495832558 |
| Min length | 5 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 235044 | 12.8% | |
| 199837 | 10.9% | ||
| n | 98214 | 5.3% | |
| r | 97807 | 5.3% | |
| h | 95673 | 5.2% | |
| i | 81532 | 4.4% | |
| e | 77948 | 4.2% | |
| S | 71221 | 3.9% | |
| l | 58103 | 3.2% | |
| M | 47437 | 2.6% | |
| A | 45060 | 2.5% | |
| o | 44573 | 2.4% | |
| t | 43127 | 2.3% | |
| P | 43087 | 2.3% | |
| m | 42874 | 2.3% | |
| s | 40755 | 2.2% | |
| d | 39436 | 2.1% | |
| K | 37577 | 2.0% | |
| u | 37385 | 2.0% | |
| R | 36308 | 2.0% | |
| J | 34139 | 1.9% | |
| B | 26551 | 1.4% | |
| D | 23728 | 1.3% | |
| g | 22417 | 1.2% | |
| C | 22012 | 1.2% | |
| Other values (31) | 234687 | 12.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1124651 | 61.2% | |
| Uppercase Letter | 511221 | 27.8% | |
| Space Separator | 199837 | 10.9% | |
| Dash Punctuation | 748 | < 0.1% | |
| Open Punctuation | 25 | < 0.1% | |
| Decimal Number | 25 | < 0.1% | |
| Close Punctuation | 25 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| S | 71221 | 13.9% | |
| M | 47437 | 9.3% | |
| A | 45060 | 8.8% | |
| P | 43087 | 8.4% | |
| K | 37577 | 7.4% | |
| R | 36308 | 7.1% | |
| J | 34139 | 6.7% | |
| B | 26551 | 5.2% | |
| D | 23728 | 4.6% | |
| C | 22012 | 4.3% | |
| H | 18707 | 3.7% | |
| T | 17299 | 3.4% | |
| N | 14986 | 2.9% | |
| L | 11162 | 2.2% | |
| V | 11069 | 2.2% | |
| W | 10026 | 2.0% | |
| G | 9036 | 1.8% | |
| Y | 8960 | 1.8% | |
| I | 7227 | 1.4% | |
| U | 5510 | 1.1% | |
| F | 2877 | 0.6% | |
| O | 2672 | 0.5% | |
| Z | 2636 | 0.5% | |
| E | 1854 | 0.4% | |
| Q | 80 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 199837 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 235044 | 20.9% | |
| n | 98214 | 8.7% | |
| r | 97807 | 8.7% | |
| h | 95673 | 8.5% | |
| i | 81532 | 7.2% | |
| e | 77948 | 6.9% | |
| l | 58103 | 5.2% | |
| o | 44573 | 4.0% | |
| t | 43127 | 3.8% | |
| m | 42874 | 3.8% | |
| s | 40755 | 3.6% | |
| d | 39436 | 3.5% | |
| u | 37385 | 3.3% | |
| g | 22417 | 2.0% | |
| k | 19810 | 1.8% | |
| y | 15899 | 1.4% | |
| j | 14073 | 1.3% | |
| w | 14055 | 1.2% | |
| v | 13735 | 1.2% | |
| b | 10491 | 0.9% | |
| p | 8925 | 0.8% | |
| c | 6030 | 0.5% | |
| f | 2366 | 0.2% | |
| z | 1962 | 0.2% | |
| q | 1861 | 0.2% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 748 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 25 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 25 | 100.0% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 25 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1635872 | 89.1% | |
| Common | 200660 | 10.9% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 235044 | 14.4% | |
| n | 98214 | 6.0% | |
| r | 97807 | 6.0% | |
| h | 95673 | 5.8% | |
| i | 81532 | 5.0% | |
| e | 77948 | 4.8% | |
| S | 71221 | 4.4% | |
| l | 58103 | 3.6% | |
| M | 47437 | 2.9% | |
| A | 45060 | 2.8% | |
| o | 44573 | 2.7% | |
| t | 43127 | 2.6% | |
| P | 43087 | 2.6% | |
| m | 42874 | 2.6% | |
| s | 40755 | 2.5% | |
| d | 39436 | 2.4% | |
| K | 37577 | 2.3% | |
| u | 37385 | 2.3% | |
| R | 36308 | 2.2% | |
| J | 34139 | 2.1% | |
| B | 26551 | 1.6% | |
| D | 23728 | 1.5% | |
| g | 22417 | 1.4% | |
| C | 22012 | 1.3% | |
| k | 19810 | 1.2% | |
| Other values (26) | 214054 | 13.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 199837 | 99.6% | ||
| - | 748 | 0.4% | |
| ( | 25 | < 0.1% | |
| 2 | 25 | < 0.1% | |
| ) | 25 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1836532 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 235044 | 12.8% | |
| 199837 | 10.9% | ||
| n | 98214 | 5.3% | |
| r | 97807 | 5.3% | |
| h | 95673 | 5.2% | |
| i | 81532 | 4.4% | |
| e | 77948 | 4.2% | |
| S | 71221 | 3.9% | |
| l | 58103 | 3.2% | |
| M | 47437 | 2.6% | |
| A | 45060 | 2.5% | |
| o | 44573 | 2.4% | |
| t | 43127 | 2.3% | |
| P | 43087 | 2.3% | |
| m | 42874 | 2.3% | |
| s | 40755 | 2.2% | |
| d | 39436 | 2.1% | |
| K | 37577 | 2.0% | |
| u | 37385 | 2.0% | |
| R | 36308 | 2.0% | |
| J | 34139 | 1.9% | |
| B | 26551 | 1.4% | |
| D | 23728 | 1.3% | |
| g | 22417 | 1.2% | |
| C | 22012 | 1.2% | |
| Other values (31) | 234687 | 12.8% |
is_super_over
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 0 | |
|---|---|
| 1 | 125 |
| Value | Count | Frequency (%) | |
| 0 | 193279 | 99.9% | |
| 1 | 125 | 0.1% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.036488387 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 187537 |
| Zeros (%) | 97.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2473992732 |
|---|---|
| Coefficient of variation (CV) | 6.780219503 |
| Kurtosis | 191.7849123 |
| Mean | 0.036488387 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 11.58720402 |
| Sum | 7057 |
| Variance | 0.06120640038 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 187537 | 97.0% | |
| 1 | 5368 | 2.8% | |
| 2 | 235 | 0.1% | |
| 5 | 211 | 0.1% | |
| 3 | 48 | < 0.1% | |
| 4 | 5 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 187537 | 97.0% | |
| 1 | 5368 | 2.8% | |
| 2 | 235 | 0.1% | |
| 3 | 48 | < 0.1% | |
| 4 | 5 | < 0.1% | |
| 5 | 211 | 0.1% |
| Value | Count | Frequency (%) | |
| 5 | 211 | 0.1% | |
| 4 | 5 | < 0.1% | |
| 3 | 48 | < 0.1% | |
| 2 | 235 | 0.1% | |
| 1 | 5368 | 2.8% | |
| 0 | 187537 | 97.0% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.004767223015 |
|---|---|
| Minimum | 0 |
| Maximum | 4 |
| Zeros | 192898 |
| Zeros (%) | 99.7% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 4 |
| Range | 4 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.1140485029 |
|---|---|
| Coefficient of variation (CV) | 23.92346708 |
| Kurtosis | 1016.235729 |
| Mean | 0.004767223015 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 30.46379722 |
| Sum | 922 |
| Variance | 0.01300706101 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 192898 | 99.7% | |
| 1 | 346 | 0.2% | |
| 4 | 127 | 0.1% | |
| 2 | 31 | < 0.1% | |
| 3 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 192898 | 99.7% | |
| 1 | 346 | 0.2% | |
| 2 | 31 | < 0.1% | |
| 3 | 2 | < 0.1% | |
| 4 | 127 | 0.1% |
| Value | Count | Frequency (%) | |
| 4 | 127 | 0.1% | |
| 3 | 2 | < 0.1% | |
| 2 | 31 | < 0.1% | |
| 1 | 346 | 0.2% | |
| 0 | 192898 | 99.7% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.02079584704 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 190292 |
| Zeros (%) | 98.4% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.1936457844 |
|---|---|
| Coefficient of variation (CV) | 9.31175268 |
| Kurtosis | 245.4605614 |
| Mean | 0.02079584704 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 13.8793515 |
| Sum | 4022 |
| Variance | 0.0374986898 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 190292 | 98.4% | |
| 1 | 2703 | 1.4% | |
| 4 | 234 | 0.1% | |
| 2 | 150 | 0.1% | |
| 3 | 21 | < 0.1% | |
| 5 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 190292 | 98.4% | |
| 1 | 2703 | 1.4% | |
| 2 | 150 | 0.1% | |
| 3 | 21 | < 0.1% | |
| 4 | 234 | 0.1% | |
| 5 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 5 | 4 | < 0.1% | |
| 4 | 234 | 0.1% | |
| 3 | 21 | < 0.1% | |
| 2 | 150 | 0.1% | |
| 1 | 2703 | 1.4% | |
| 0 | 190292 | 98.4% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.004544890488 |
|---|---|
| Minimum | 0 |
| Maximum | 7 |
| Zeros | 192640 |
| Zeros (%) | 99.6% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.08523291428 |
|---|---|
| Coefficient of variation (CV) | 18.75356832 |
| Kurtosis | 1941.740098 |
| Mean | 0.004544890488 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 34.78726606 |
| Sum | 879 |
| Variance | 0.007264649676 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 192640 | 99.6% | |
| 1 | 717 | 0.4% | |
| 2 | 23 | < 0.1% | |
| 5 | 14 | < 0.1% | |
| 3 | 6 | < 0.1% | |
| 7 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 192640 | 99.6% | |
| 1 | 717 | 0.4% | |
| 2 | 23 | < 0.1% | |
| 3 | 6 | < 0.1% | |
| 5 | 14 | < 0.1% | |
| 7 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 7 | 4 | < 0.1% | |
| 5 | 14 | < 0.1% | |
| 3 | 6 | < 0.1% | |
| 2 | 23 | < 0.1% | |
| 1 | 717 | 0.4% | |
| 0 | 192640 | 99.6% |
penalty_runs
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 0 | |
|---|---|
| 5 | 2 |
| Value | Count | Frequency (%) | |
| 0 | 193402 | > 99.9% | |
| 5 | 2 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 193402 | > 99.9% | |
| 5 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 193404 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 193402 | > 99.9% | |
| 5 | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 193404 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 193402 | > 99.9% | |
| 5 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 193404 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 193402 | > 99.9% | |
| 5 | 2 | < 0.1% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.251375359 |
|---|---|
| Minimum | 0 |
| Maximum | 7 |
| Zeros | 76062 |
| Zeros (%) | 39.3% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 4 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.609800822 |
|---|---|
| Coefficient of variation (CV) | 1.28642522 |
| Kurtosis | 1.626350135 |
| Mean | 1.251375359 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.58054888 |
| Sum | 242021 |
| Variance | 2.591458686 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 76062 | 39.3% | |
| 1 | 73215 | 37.9% | |
| 4 | 21997 | 11.4% | |
| 2 | 12496 | 6.5% | |
| 6 | 8906 | 4.6% | |
| 3 | 636 | 0.3% | |
| 5 | 81 | < 0.1% | |
| 7 | 11 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 76062 | 39.3% | |
| 1 | 73215 | 37.9% | |
| 2 | 12496 | 6.5% | |
| 3 | 636 | 0.3% | |
| 4 | 21997 | 11.4% | |
| 5 | 81 | < 0.1% | |
| 6 | 8906 | 4.6% | |
| 7 | 11 | < 0.1% |
| Value | Count | Frequency (%) | |
| 7 | 11 | < 0.1% | |
| 6 | 8906 | 4.6% | |
| 5 | 81 | < 0.1% | |
| 4 | 21997 | 11.4% | |
| 3 | 636 | 0.3% | |
| 2 | 12496 | 6.5% | |
| 1 | 73215 | 37.9% | |
| 0 | 76062 | 39.3% |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.06664805278 |
|---|---|
| Minimum | 0 |
| Maximum | 7 |
| Zeros | 183154 |
| Zeros (%) | 94.7% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3416196934 |
|---|---|
| Coefficient of variation (CV) | 5.125726547 |
| Kurtosis | 93.48815611 |
| Mean | 0.06664805278 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.304922399 |
| Sum | 12890 |
| Variance | 0.1167040149 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 183154 | 94.7% | |
| 1 | 9134 | 4.7% | |
| 2 | 438 | 0.2% | |
| 4 | 366 | 0.2% | |
| 5 | 230 | 0.1% | |
| 3 | 77 | < 0.1% | |
| 7 | 5 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 183154 | 94.7% | |
| 1 | 9134 | 4.7% | |
| 2 | 438 | 0.2% | |
| 3 | 77 | < 0.1% | |
| 4 | 366 | 0.2% | |
| 5 | 230 | 0.1% | |
| 7 | 5 | < 0.1% |
| Value | Count | Frequency (%) | |
| 7 | 5 | < 0.1% | |
| 5 | 230 | 0.1% | |
| 4 | 366 | 0.2% | |
| 3 | 77 | < 0.1% | |
| 2 | 438 | 0.2% | |
| 1 | 9134 | 4.7% | |
| 0 | 183154 | 94.7% |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.318023412 |
|---|---|
| Minimum | 0 |
| Maximum | 10 |
| Zeros | 67527 |
| Zeros (%) | 34.9% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 4 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.60605852 |
|---|---|
| Coefficient of variation (CV) | 1.218535654 |
| Kurtosis | 1.631821194 |
| Mean | 1.318023412 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.555653332 |
| Sum | 254911 |
| Variance | 2.57942397 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 79356 | 41.0% | |
| 0 | 67527 | 34.9% | |
| 4 | 22218 | 11.5% | |
| 2 | 14192 | 7.3% | |
| 6 | 8884 | 4.6% | |
| 3 | 748 | 0.4% | |
| 5 | 357 | 0.2% | |
| 8 | 64 | < 0.1% | |
| 7 | 42 | < 0.1% | |
| 10 | 16 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 67527 | 34.9% | |
| 1 | 79356 | 41.0% | |
| 2 | 14192 | 7.3% | |
| 3 | 748 | 0.4% | |
| 4 | 22218 | 11.5% | |
| 5 | 357 | 0.2% | |
| 6 | 8884 | 4.6% | |
| 7 | 42 | < 0.1% | |
| 8 | 64 | < 0.1% | |
| 10 | 16 | < 0.1% |
| Value | Count | Frequency (%) | |
| 10 | 16 | < 0.1% | |
| 8 | 64 | < 0.1% | |
| 7 | 42 | < 0.1% | |
| 6 | 8884 | 4.6% | |
| 5 | 357 | 0.2% | |
| 4 | 22218 | 11.5% | |
| 3 | 748 | 0.4% | |
| 2 | 14192 | 7.3% | |
| 1 | 79356 | 41.0% | |
| 0 | 67527 | 34.9% |
| Distinct | 507 |
|---|---|
| Distinct (%) | 5.6% |
| Missing | 184357 |
| Missing (%) | 95.3% |
| Memory size | 1.5 MiB |
| SK Raina | 162 |
|---|---|
| RG Sharma | 159 |
| RV Uthappa | 155 |
| V Kohli | 145 |
| S Dhawan | 143 |
| Other values (502) |
| Value | Count | Frequency (%) | |
| SK Raina | 162 | 0.1% | |
| RG Sharma | 159 | 0.1% | |
| RV Uthappa | 155 | 0.1% | |
| V Kohli | 145 | 0.1% | |
| S Dhawan | 143 | 0.1% | |
| KD Karthik | 140 | 0.1% | |
| G Gambhir | 136 | 0.1% | |
| PA Patel | 126 | 0.1% | |
| SR Watson | 121 | 0.1% | |
| AM Rahane | 121 | 0.1% | |
| AT Rayudu | 118 | 0.1% | |
| DA Warner | 114 | 0.1% | |
| AB de Villiers | 113 | 0.1% | |
| CH Gayle | 112 | 0.1% | |
| Yuvraj Singh | 111 | 0.1% | |
| YK Pathan | 110 | 0.1% | |
| MS Dhoni | 107 | 0.1% | |
| BB McCullum | 104 | 0.1% | |
| KA Pollard | 101 | 0.1% | |
| MK Pandey | 100 | 0.1% | |
| V Sehwag | 99 | 0.1% | |
| M Vijay | 99 | 0.1% | |
| JH Kallis | 85 | < 0.1% | |
| DR Smith | 84 | < 0.1% | |
| SV Samson | 82 | < 0.1% | |
| Other values (482) | 6100 | 3.2% | |
| (Missing) | 184357 | 95.3% |
Unique
| Unique | 91 ? |
|---|---|
| Unique (%) | 1.0% |
Length
| Max length | 20 |
|---|---|
| Median length | 3 |
| Mean length | 3.296948357 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 372631 | 58.4% | |
| a | 193988 | 30.4% | |
| 9481 | 1.5% | ||
| i | 4044 | 0.6% | |
| h | 3968 | 0.6% | |
| r | 3717 | 0.6% | |
| e | 3412 | 0.5% | |
| S | 3358 | 0.5% | |
| l | 3039 | 0.5% | |
| A | 2183 | 0.3% | |
| R | 2176 | 0.3% | |
| M | 2170 | 0.3% | |
| s | 2110 | 0.3% | |
| K | 1996 | 0.3% | |
| t | 1982 | 0.3% | |
| o | 1919 | 0.3% | |
| P | 1871 | 0.3% | |
| d | 1781 | 0.3% | |
| u | 1731 | 0.3% | |
| D | 1587 | 0.2% | |
| y | 1473 | 0.2% | |
| m | 1446 | 0.2% | |
| J | 1274 | 0.2% | |
| V | 1097 | 0.2% | |
| G | 1086 | 0.2% | |
| Other values (29) | 12123 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 602596 | 94.5% | |
| Uppercase Letter | 25543 | 4.0% | |
| Space Separator | 9481 | 1.5% | |
| Dash Punctuation | 23 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 372631 | 61.8% | |
| a | 193988 | 32.2% | |
| i | 4044 | 0.7% | |
| h | 3968 | 0.7% | |
| r | 3717 | 0.6% | |
| e | 3412 | 0.6% | |
| l | 3039 | 0.5% | |
| s | 2110 | 0.4% | |
| t | 1982 | 0.3% | |
| o | 1919 | 0.3% | |
| d | 1781 | 0.3% | |
| u | 1731 | 0.3% | |
| y | 1473 | 0.2% | |
| m | 1446 | 0.2% | |
| g | 987 | 0.2% | |
| w | 870 | 0.1% | |
| k | 808 | 0.1% | |
| p | 603 | 0.1% | |
| j | 526 | 0.1% | |
| v | 467 | 0.1% | |
| c | 448 | 0.1% | |
| b | 389 | 0.1% | |
| x | 80 | < 0.1% | |
| f | 74 | < 0.1% | |
| z | 60 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| S | 3358 | 13.1% | |
| A | 2183 | 8.5% | |
| R | 2176 | 8.5% | |
| M | 2170 | 8.5% | |
| K | 1996 | 7.8% | |
| P | 1871 | 7.3% | |
| D | 1587 | 6.2% | |
| J | 1274 | 5.0% | |
| V | 1097 | 4.3% | |
| G | 1086 | 4.3% | |
| B | 1080 | 4.2% | |
| C | 1052 | 4.1% | |
| H | 885 | 3.5% | |
| T | 768 | 3.0% | |
| L | 527 | 2.1% | |
| W | 513 | 2.0% | |
| N | 485 | 1.9% | |
| Y | 409 | 1.6% | |
| F | 222 | 0.9% | |
| U | 221 | 0.9% | |
| E | 205 | 0.8% | |
| I | 178 | 0.7% | |
| O | 119 | 0.5% | |
| Q | 61 | 0.2% | |
| Z | 19 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 9481 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 23 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 628139 | 98.5% | |
| Common | 9504 | 1.5% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 372631 | 59.3% | |
| a | 193988 | 30.9% | |
| i | 4044 | 0.6% | |
| h | 3968 | 0.6% | |
| r | 3717 | 0.6% | |
| e | 3412 | 0.5% | |
| S | 3358 | 0.5% | |
| l | 3039 | 0.5% | |
| A | 2183 | 0.3% | |
| R | 2176 | 0.3% | |
| M | 2170 | 0.3% | |
| s | 2110 | 0.3% | |
| K | 1996 | 0.3% | |
| t | 1982 | 0.3% | |
| o | 1919 | 0.3% | |
| P | 1871 | 0.3% | |
| d | 1781 | 0.3% | |
| u | 1731 | 0.3% | |
| D | 1587 | 0.3% | |
| y | 1473 | 0.2% | |
| m | 1446 | 0.2% | |
| J | 1274 | 0.2% | |
| V | 1097 | 0.2% | |
| G | 1086 | 0.2% | |
| B | 1080 | 0.2% | |
| Other values (27) | 11020 | 1.8% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 9481 | 99.8% | ||
| - | 23 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 637643 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 372631 | 58.4% | |
| a | 193988 | 30.4% | |
| 9481 | 1.5% | ||
| i | 4044 | 0.6% | |
| h | 3968 | 0.6% | |
| r | 3717 | 0.6% | |
| e | 3412 | 0.5% | |
| S | 3358 | 0.5% | |
| l | 3039 | 0.5% | |
| A | 2183 | 0.3% | |
| R | 2176 | 0.3% | |
| M | 2170 | 0.3% | |
| s | 2110 | 0.3% | |
| K | 1996 | 0.3% | |
| t | 1982 | 0.3% | |
| o | 1919 | 0.3% | |
| P | 1871 | 0.3% | |
| d | 1781 | 0.3% | |
| u | 1731 | 0.3% | |
| D | 1587 | 0.2% | |
| y | 1473 | 0.2% | |
| m | 1446 | 0.2% | |
| J | 1274 | 0.2% | |
| V | 1097 | 0.2% | |
| G | 1086 | 0.2% | |
| Other values (29) | 12123 | 1.9% |
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 184357 |
| Missing (%) | 95.3% |
| Memory size | 1.5 MiB |
| caught | |
|---|---|
| bowled | |
| run out | |
| lbw | |
| stumped | 279 |
| Other values (4) | 237 |
| Value | Count | Frequency (%) | |
| caught | 5393 | 2.8% | |
| bowled | 1710 | 0.9% | |
| run out | 856 | 0.4% | |
| lbw | 572 | 0.3% | |
| stumped | 279 | 0.1% | |
| caught and bowled | 211 | 0.1% | |
| retired hurt | 12 | < 0.1% | |
| hit wicket | 12 | < 0.1% | |
| obstructing the field | 2 | < 0.1% | |
| (Missing) | 184357 | 95.3% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 21 |
|---|---|
| Median length | 3 |
| Mean length | 3.150105479 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 369783 | 60.7% | |
| a | 190172 | 31.2% | |
| u | 7609 | 1.2% | |
| t | 6793 | 1.1% | |
| h | 5630 | 0.9% | |
| c | 5618 | 0.9% | |
| g | 5606 | 0.9% | |
| o | 2779 | 0.5% | |
| w | 2505 | 0.4% | |
| b | 2495 | 0.4% | |
| l | 2495 | 0.4% | |
| d | 2425 | 0.4% | |
| e | 2240 | 0.4% | |
| 1306 | 0.2% | ||
| r | 894 | 0.1% | |
| s | 281 | < 0.1% | |
| m | 279 | < 0.1% | |
| p | 279 | < 0.1% | |
| i | 40 | < 0.1% | |
| k | 12 | < 0.1% | |
| f | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 607937 | 99.8% | |
| Space Separator | 1306 | 0.2% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 369783 | 60.8% | |
| a | 190172 | 31.3% | |
| u | 7609 | 1.3% | |
| t | 6793 | 1.1% | |
| h | 5630 | 0.9% | |
| c | 5618 | 0.9% | |
| g | 5606 | 0.9% | |
| o | 2779 | 0.5% | |
| w | 2505 | 0.4% | |
| b | 2495 | 0.4% | |
| l | 2495 | 0.4% | |
| d | 2425 | 0.4% | |
| e | 2240 | 0.4% | |
| r | 894 | 0.1% | |
| s | 281 | < 0.1% | |
| m | 279 | < 0.1% | |
| p | 279 | < 0.1% | |
| i | 40 | < 0.1% | |
| k | 12 | < 0.1% | |
| f | 2 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 1306 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 607937 | 99.8% | |
| Common | 1306 | 0.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 369783 | 60.8% | |
| a | 190172 | 31.3% | |
| u | 7609 | 1.3% | |
| t | 6793 | 1.1% | |
| h | 5630 | 0.9% | |
| c | 5618 | 0.9% | |
| g | 5606 | 0.9% | |
| o | 2779 | 0.5% | |
| w | 2505 | 0.4% | |
| b | 2495 | 0.4% | |
| l | 2495 | 0.4% | |
| d | 2425 | 0.4% | |
| e | 2240 | 0.4% | |
| r | 894 | 0.1% | |
| s | 281 | < 0.1% | |
| m | 279 | < 0.1% | |
| p | 279 | < 0.1% | |
| i | 40 | < 0.1% | |
| k | 12 | < 0.1% | |
| f | 2 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1306 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 609243 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 369783 | 60.7% | |
| a | 190172 | 31.2% | |
| u | 7609 | 1.2% | |
| t | 6793 | 1.1% | |
| h | 5630 | 0.9% | |
| c | 5618 | 0.9% | |
| g | 5606 | 0.9% | |
| o | 2779 | 0.5% | |
| w | 2505 | 0.4% | |
| b | 2495 | 0.4% | |
| l | 2495 | 0.4% | |
| d | 2425 | 0.4% | |
| e | 2240 | 0.4% | |
| 1306 | 0.2% | ||
| r | 894 | 0.1% | |
| s | 281 | < 0.1% | |
| m | 279 | < 0.1% | |
| p | 279 | < 0.1% | |
| i | 40 | < 0.1% | |
| k | 12 | < 0.1% | |
| f | 2 | < 0.1% |
| Distinct | 507 |
|---|---|
| Distinct (%) | 7.8% |
| Missing | 186906 |
| Missing (%) | 96.6% |
| Memory size | 1.5 MiB |
| MS Dhoni | 159 |
|---|---|
| KD Karthik | 152 |
| RV Uthappa | 125 |
| AB de Villiers | 117 |
| SK Raina | 115 |
| Other values (502) |
| Value | Count | Frequency (%) | |
| MS Dhoni | 159 | 0.1% | |
| KD Karthik | 152 | 0.1% | |
| RV Uthappa | 125 | 0.1% | |
| AB de Villiers | 117 | 0.1% | |
| SK Raina | 115 | 0.1% | |
| PA Patel | 97 | 0.1% | |
| RG Sharma | 93 | < 0.1% | |
| V Kohli | 90 | < 0.1% | |
| KA Pollard | 85 | < 0.1% | |
| WP Saha | 84 | < 0.1% | |
| NV Ojha | 82 | < 0.1% | |
| RA Jadeja | 80 | < 0.1% | |
| MK Pandey | 78 | < 0.1% | |
| DJ Bravo | 78 | < 0.1% | |
| AC Gilchrist | 75 | < 0.1% | |
| S Dhawan | 74 | < 0.1% | |
| AM Rahane | 66 | < 0.1% | |
| AT Rayudu | 65 | < 0.1% | |
| DA Warner | 64 | < 0.1% | |
| KC Sangakkara | 58 | < 0.1% | |
| SV Samson | 58 | < 0.1% | |
| DA Miller | 53 | < 0.1% | |
| SPD Smith | 51 | < 0.1% | |
| YK Pathan | 51 | < 0.1% | |
| BB McCullum | 50 | < 0.1% | |
| Other values (482) | 4398 | 2.3% | |
| (Missing) | 186906 | 96.6% |
Unique
| Unique | 97 ? |
|---|---|
| Unique (%) | 1.5% |
Length
| Max length | 21 |
|---|---|
| Median length | 3 |
| Mean length | 3.217193026 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 376561 | 60.5% | |
| a | 193968 | 31.2% | |
| 6931 | 1.1% | ||
| i | 3059 | 0.5% | |
| h | 2995 | 0.5% | |
| r | 2696 | 0.4% | |
| e | 2443 | 0.4% | |
| S | 2368 | 0.4% | |
| l | 2173 | 0.3% | |
| K | 1604 | 0.3% | |
| M | 1568 | 0.3% | |
| t | 1563 | 0.3% | |
| A | 1542 | 0.2% | |
| s | 1527 | 0.2% | |
| R | 1496 | 0.2% | |
| o | 1441 | 0.2% | |
| P | 1381 | 0.2% | |
| d | 1375 | 0.2% | |
| u | 1252 | 0.2% | |
| D | 1244 | 0.2% | |
| m | 971 | 0.2% | |
| J | 917 | 0.1% | |
| B | 848 | 0.1% | |
| y | 842 | 0.1% | |
| V | 795 | 0.1% | |
| Other values (30) | 8658 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 596869 | 95.9% | |
| Uppercase Letter | 18254 | 2.9% | |
| Space Separator | 6931 | 1.1% | |
| Open Punctuation | 76 | < 0.1% | |
| Close Punctuation | 76 | < 0.1% | |
| Dash Punctuation | 12 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 376561 | 63.1% | |
| a | 193968 | 32.5% | |
| i | 3059 | 0.5% | |
| h | 2995 | 0.5% | |
| r | 2696 | 0.5% | |
| e | 2443 | 0.4% | |
| l | 2173 | 0.4% | |
| t | 1563 | 0.3% | |
| s | 1527 | 0.3% | |
| o | 1441 | 0.2% | |
| d | 1375 | 0.2% | |
| u | 1252 | 0.2% | |
| m | 971 | 0.2% | |
| y | 842 | 0.1% | |
| k | 748 | 0.1% | |
| g | 628 | 0.1% | |
| w | 592 | 0.1% | |
| p | 439 | 0.1% | |
| j | 390 | 0.1% | |
| v | 386 | 0.1% | |
| b | 327 | 0.1% | |
| c | 316 | 0.1% | |
| f | 57 | < 0.1% | |
| q | 44 | < 0.1% | |
| z | 40 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| S | 2368 | 13.0% | |
| K | 1604 | 8.8% | |
| M | 1568 | 8.6% | |
| A | 1542 | 8.4% | |
| R | 1496 | 8.2% | |
| P | 1381 | 7.6% | |
| D | 1244 | 6.8% | |
| J | 917 | 5.0% | |
| B | 848 | 4.6% | |
| V | 795 | 4.4% | |
| C | 660 | 3.6% | |
| G | 592 | 3.2% | |
| H | 581 | 3.2% | |
| T | 547 | 3.0% | |
| W | 364 | 2.0% | |
| N | 341 | 1.9% | |
| L | 324 | 1.8% | |
| Y | 296 | 1.6% | |
| U | 207 | 1.1% | |
| F | 138 | 0.8% | |
| I | 132 | 0.7% | |
| E | 120 | 0.7% | |
| O | 118 | 0.6% | |
| Q | 46 | 0.3% | |
| Z | 25 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 6931 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 76 | 100.0% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 76 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 12 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 615123 | 98.9% | |
| Common | 7095 | 1.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 376561 | 61.2% | |
| a | 193968 | 31.5% | |
| i | 3059 | 0.5% | |
| h | 2995 | 0.5% | |
| r | 2696 | 0.4% | |
| e | 2443 | 0.4% | |
| S | 2368 | 0.4% | |
| l | 2173 | 0.4% | |
| K | 1604 | 0.3% | |
| M | 1568 | 0.3% | |
| t | 1563 | 0.3% | |
| A | 1542 | 0.3% | |
| s | 1527 | 0.2% | |
| R | 1496 | 0.2% | |
| o | 1441 | 0.2% | |
| P | 1381 | 0.2% | |
| d | 1375 | 0.2% | |
| u | 1252 | 0.2% | |
| D | 1244 | 0.2% | |
| m | 971 | 0.2% | |
| J | 917 | 0.1% | |
| B | 848 | 0.1% | |
| y | 842 | 0.1% | |
| V | 795 | 0.1% | |
| k | 748 | 0.1% | |
| Other values (26) | 7746 | 1.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 6931 | 97.7% | ||
| ( | 76 | 1.1% | |
| ) | 76 | 1.1% | |
| - | 12 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 622218 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 376561 | 60.5% | |
| a | 193968 | 31.2% | |
| 6931 | 1.1% | ||
| i | 3059 | 0.5% | |
| h | 2995 | 0.5% | |
| r | 2696 | 0.4% | |
| e | 2443 | 0.4% | |
| S | 2368 | 0.4% | |
| l | 2173 | 0.3% | |
| K | 1604 | 0.3% | |
| M | 1568 | 0.3% | |
| t | 1563 | 0.3% | |
| A | 1542 | 0.2% | |
| s | 1527 | 0.2% | |
| R | 1496 | 0.2% | |
| o | 1441 | 0.2% | |
| P | 1381 | 0.2% | |
| d | 1375 | 0.2% | |
| u | 1252 | 0.2% | |
| D | 1244 | 0.2% | |
| m | 971 | 0.2% | |
| J | 917 | 0.1% | |
| B | 848 | 0.1% | |
| y | 842 | 0.1% | |
| V | 795 | 0.1% | |
| Other values (30) | 8658 | 1.4% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| match_id | inning | batting_team | bowling_team | over | ball | batsman | non_striker | bowler | is_super_over | wide_runs | bye_runs | legbye_runs | noball_runs | penalty_runs | batsman_runs | extra_runs | total_runs | player_dismissed | dismissal_kind | fielder | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 1 | 1 | DA Warner | S Dhawan | TS Mills | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NaN | NaN | NaN |
| 1 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 1 | 2 | DA Warner | S Dhawan | TS Mills | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NaN | NaN | NaN |
| 2 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 1 | 3 | DA Warner | S Dhawan | TS Mills | 0 | 0 | 0 | 0 | 0 | 0 | 4 | 0 | 4 | NaN | NaN | NaN |
| 3 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 1 | 4 | DA Warner | S Dhawan | TS Mills | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NaN | NaN | NaN |
| 4 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 1 | 5 | DA Warner | S Dhawan | TS Mills | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | NaN | NaN | NaN |
| 5 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 1 | 6 | S Dhawan | DA Warner | TS Mills | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NaN | NaN | NaN |
| 6 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 1 | 7 | S Dhawan | DA Warner | TS Mills | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 1 | NaN | NaN | NaN |
| 7 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 2 | 1 | S Dhawan | DA Warner | A Choudhary | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | NaN | NaN | NaN |
| 8 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 2 | 2 | DA Warner | S Dhawan | A Choudhary | 0 | 0 | 0 | 0 | 0 | 0 | 4 | 0 | 4 | NaN | NaN | NaN |
| 9 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 2 | 3 | DA Warner | S Dhawan | A Choudhary | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | NaN | NaN | NaN |
Last rows
| match_id | inning | batting_team | bowling_team | over | ball | batsman | non_striker | bowler | is_super_over | wide_runs | bye_runs | legbye_runs | noball_runs | penalty_runs | batsman_runs | extra_runs | total_runs | player_dismissed | dismissal_kind | fielder | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 193394 | 1237181 | 2 | MI | DC | 18 | 1 | KA Pollard | Ishan Kishan | K Rabada | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | KA Pollard | bowled | NaN |
| 193395 | 1237181 | 2 | MI | DC | 18 | 2 | HH Pandya | Ishan Kishan | K Rabada | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | NaN | NaN | NaN |
| 193396 | 1237181 | 2 | MI | DC | 18 | 3 | Ishan Kishan | HH Pandya | K Rabada | 0 | 0 | 0 | 0 | 0 | 0 | 4 | 0 | 4 | NaN | NaN | NaN |
| 193397 | 1237181 | 2 | MI | DC | 18 | 4 | Ishan Kishan | HH Pandya | K Rabada | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | NaN | NaN | NaN |
| 193398 | 1237181 | 2 | MI | DC | 18 | 5 | HH Pandya | Ishan Kishan | K Rabada | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NaN | NaN | NaN |
| 193399 | 1237181 | 2 | MI | DC | 18 | 6 | HH Pandya | Ishan Kishan | K Rabada | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | NaN | NaN | NaN |
| 193400 | 1237181 | 2 | MI | DC | 19 | 1 | HH Pandya | Ishan Kishan | Anrich Nortje | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | NaN | NaN | NaN |
| 193401 | 1237181 | 2 | MI | DC | 19 | 2 | Ishan Kishan | HH Pandya | Anrich Nortje | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | NaN | NaN | NaN |
| 193402 | 1237181 | 2 | MI | DC | 19 | 3 | HH Pandya | Ishan Kishan | Anrich Nortje | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | HH Pandya | caught | AM Rahane |
| 193403 | 1237181 | 2 | MI | DC | 19 | 4 | KH Pandya | Ishan Kishan | Anrich Nortje | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | NaN | NaN | NaN |